Predictive Q-routing: a Memory-based Reinforcement Learning Approach to Adaptive Traac Control
نویسندگان
چکیده
In this paper, we propose a memory-based Q-learning algorithm called predictive Q-routing (PQ-routing) for adaptive traac control. We attempt to address two problems encountered in Q-routing (Boyan & Littman, 1994), namely, the inability to ne-tune routing policies under low network load and the inability to learn new optimal policies under decreasing load conditions. Unlike other memory-based reinforcement learning algorithms in which memory is used to keep past experiences to increase learning speed, PQ-routing keeps the best experiences learned and reuses them by predicting the traac trend. The eeectiveness of PQ-routing has been veriied under various network topologies and traac conditions. Simulation results show that PQ-routing is superior to Q-routing in terms of both learning speed and adaptability.
منابع مشابه
Predictive Q-Routing: A Memory-based Reinforcement Learning Approach to Adaptive Traffic Control
In this paper, we propose a memory-based Q-Iearning algorithm called predictive Q-routing (PQ-routing) for adaptive traffic control. We attempt to address two problems encountered in Q-routing (Boyan & Littman, 1994), namely, the inability to fine-tune routing policies under low network load and the inability to learn new optimal policies under decreasing load conditions. Unlike other memory-ba...
متن کاملMulticast Routing in Wireless Sensor Networks: A Distributed Reinforcement Learning Approach
Wireless Sensor Networks (WSNs) are consist of independent distributed sensors with storing, processing, sensing and communication capabilities to monitor physical or environmental conditions. There are number of challenges in WSNs because of limitation of battery power, communications, computation and storage space. In the recent years, computational intelligence approaches such as evolutionar...
متن کاملMini/Micro-Grid Adaptive Voltage and Frequency Stability Enhancement Using Q-learning Mechanism
This paper develops an adaptive control method for controlling frequency and voltage of an islanded mini/micro grid (M/µG) using reinforcement learning method. Reinforcement learning (RL) is one of the branches of the machine learning, which is the main solution method of Markov decision process (MDPs). Among the several solution methods of RL, the Q-learning method is used for solving RL in th...
متن کاملReinforcement Learning Based PID Control of Wind Energy Conversion Systems
In this paper an adaptive PID controller for Wind Energy Conversion Systems (WECS) has been developed. Theadaptation technique applied to this controller is based on Reinforcement Learning (RL) theory. Nonlinearcharacteristics of wind variations as plant input, wind turbine structure and generator operational behaviordemand for high quality adaptive controller to ensure both robust stability an...
متن کاملConndence Based Dual Reinforcement Q-routing: an Adaptive Online Network Routing Algorithm
This paper describes and evaluates the Conndence-based Dual Reinforcement Q-Routing algorithm (CDRQ-Routing) for adap-tive packet routing in communication networks. CDRQ-Routing is based on an application of the Q-learning framework to network routing, as rst proposed by Littman and Boyan (1993). The main contribution of CDRQ-routing is an increased quantity and an improved quality of explorati...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1996